The Atlas of Living Australia data for the area around Monash’s Clayton campus, Melbourne has been downloaded for you.
monash <- read_csv("data/monash_species.csv")
Event Date - parsed
Min. :1898-07-14
1st Qu.:2014-08-07
Median :2016-04-27
Mean :2014-12-17
3rd Qu.:2018-04-07
Max. :2018-08-30
NA's :5
316# A tibble: 316 x 2
`Scientific Name` n
<chr> <int>
1 Cracticus tibicen 861
2 Manorina (Myzantha) melanocephala 704
3 Trichoglossus haematodus 630
4 Glossopsitta concinna 561
5 Grallina cyanoleuca 533
6 Corvus mellori 518
7 Anthochaera carunculata 500
8 Gallinula (Gallinula) tenebrosa tenebrosa 361
9 Hirundo neoxena 332
10 Anas (Anas) superciliosa 330
# ... with 306 more rows
Cracticus tibicen which is the Australian magpie.
It’s a bit surprising to see sightings dated in the 1800s. Let’s look at these in more detail.
# A tibble: 10 x 2
`Vernacular name` n
<chr> <int>
1 Australasian Bittern 1
2 Australian Pipit 1
3 Budgerigar 1
4 Fork-tailed swift 1
5 Regent Honeyeater 1
6 Shining Bronze-cuckoo 4
7 White-browed Woodswallow 1
8 White-winged Triller 1
9 Yellow-tail 1
10 <NA> 1
Subset the data to species seen after 1950. Count the number of sightings of each species. Randomly sample 4 from the ones that have been sighted at least 100 times. List the Scientific names of your four species.
myspecies <- monash %>%
filter(year(`Event Date - parsed`) >= 1950) %>%
count(`Scientific Name`, sort=TRUE) %>%
filter(n > 100) %>%
sample_n(4)
mysample <- monash %>%
filter(year(`Event Date - parsed`) >= 1950) %>%
filter(`Scientific Name` %in% myspecies$`Scientific Name`)
# My four
> myspecies
# A tibble: 4 x 2
`Scientific Name` n
<chr> <int>
1 Anser 175
2 Cracticus torquatus 307
3 Cracticus tibicen 861
4 Streptopelia (Spilopelia) chinensis 302
The magpie is spotted more in nesting season, Aug-Nov, but the other three of mine are seen routinely throughout the year.
During the day, starting around 9-10, tapering off in the afternoon. This is most likely following human patterns, that the species are reported during the work day.
We can learn a lot about the species based on the differentiated keywords. For example, “Anser” are geese, and are likely an introduced species because “America” pops up. Similarly, the spotted dove is an introduced species because India and Hawaii pop up. Magpies could be dangerous (as we know) because the words “attack” and “aggressive” pop out. And the bucherbird is known for its “singing”.
Two points reserved for easy to compile, spell-checked, nicely turned in work.